Add cached read and writes to stats and cost calculation for LiteLLM provider #4206

mollux · 2025-06-01T19:57:30Z

Related GitHub Issue

Closes: #

Description

The current implementation of LiteLLM doesn't take into account cached tokens (both write and read), resulting in incorrect cost calculation.
This PR stores the relevant information (taking into account the Anthropic only cache_creation_input_tokens, see https://docs.litellm.ai/docs/completion/prompt_caching), and uses it during cost calculation and visualisation of the cached tokens.

Test Procedure

configure a LiteLLM instance
expose a model that supports prompt caching following the OpenAI spec (so e.g. Anthropic models, no Gemini 2.5)
configure Roo code to use the LiteLLM model
validate that the prompt stats contains cache information, and that the cached tokens correspond to the values in the LiteLLM request logs.

note: depending on the model, the cost calculation can be wrong in liteLLM (e.g. for Anthropic models), as cache calculation has some quirks there too. This doesn't change the usefulness of supporting that cache info in Roo Code

Type of Change

🐛 Bug Fix: Non-breaking change that fixes an issue.
✨ New Feature: Non-breaking change that adds functionality.
💥 Breaking Change: Fix or feature that would cause existing functionality to not work as expected.
♻️ Refactor: Code change that neither fixes a bug nor adds a feature.
💅 Style: Changes that do not affect the meaning of the code (white-space, formatting, etc.).
📚 Documentation: Updates to documentation files.
⚙️ Build/CI: Changes to the build process or CI configuration.
🧹 Chore: Other changes that don't modify src or test files.

Pre-Submission Checklist

Screenshots / Videos

Documentation Updates

Additional Notes

Get in Touch

Important

Adds cached token cost calculation to LiteLLM, updating model info and usage data reporting.

Behavior:
- Adds cached token cost calculation in LiteLLMHandler in lite-llm.ts.
- Updates getLiteLLMModels in litellm.ts to include cacheWritesPrice and cacheReadsPrice.
Cost Calculation:
- Updates calculateApiCostOpenAI usage to include cache write and read tokens in lite-llm.ts.
Usage Data:
- Adds cacheWriteTokens and cacheReadTokens to ApiStreamUsageChunk in lite-llm.ts.

^{This description was created by}^{for 3716053. You can customize this summary. It will automatically update as commits are pushed.}

mrubens

Thanks!

mrubens · 2025-06-01T20:17:42Z

Let me know if I can help getting the tests to pass

mollux · 2025-06-01T20:25:14Z

The compile error should be fixed, but I don't get the platform unit test failure.
It seems unrelated, but I may be missing something.

mollux · 2025-06-01T20:30:34Z

same failure appears on other PR's, e.g. #4206 and #4210, and seems to be introduced in 5e50c55

so probably unrelated for this PR.

mrubens · 2025-06-01T20:35:33Z

Ah ok, will try to figure it out separately. Thank you for the PR!

taylorwilsdon · 2025-06-01T20:35:58Z

I saw the mention of this in #4210 and will have it fixed in that branch if I can save you the time @mrubens (edit - fixed and all passing now)

hannesrudolph · 2025-06-21T22:12:58Z

@mollux can you please shoot me a discord DM at hrudolph?

Add cached read and writes to cost calculation for LiteLLM

3716053

mollux requested review from cte and mrubens as code owners June 1, 2025 19:57

github-project-automation bot added this to Roo Code Roadmap Jun 1, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jun 1, 2025

github-project-automation bot added this to Roo Code Roadmap Jun 1, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jun 1, 2025

dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. bug Something isn't working labels Jun 1, 2025

mollux changed the title ~~Add cached read and writes to cost calculation for LiteLLM~~ Add cached read and writes to stats and cost calculation for LiteLLM provider Jun 1, 2025

mrubens approved these changes Jun 1, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jun 1, 2025

Fixed property issue

08ac3fe

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Jun 1, 2025

mrubens merged commit dca1076 into RooCodeInc:main Jun 1, 2025
9 of 11 checks passed

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Jun 1, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Jun 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add cached read and writes to stats and cost calculation for LiteLLM provider #4206

Add cached read and writes to stats and cost calculation for LiteLLM provider #4206

Uh oh!

mollux commented Jun 1, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

mrubens left a comment

Uh oh!

mrubens commented Jun 1, 2025

Uh oh!

mollux commented Jun 1, 2025

Uh oh!

mollux commented Jun 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

mrubens commented Jun 1, 2025

Uh oh!

taylorwilsdon commented Jun 1, 2025 •

edited

Loading

Uh oh!

hannesrudolph commented Jun 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add cached read and writes to stats and cost calculation for LiteLLM provider #4206

Add cached read and writes to stats and cost calculation for LiteLLM provider #4206

Uh oh!

Conversation

mollux commented Jun 1, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related GitHub Issue

Description

Test Procedure

Type of Change

Pre-Submission Checklist

Screenshots / Videos

Documentation Updates

Additional Notes

Get in Touch

Uh oh!

mrubens left a comment

Choose a reason for hiding this comment

Uh oh!

mrubens commented Jun 1, 2025

Uh oh!

mollux commented Jun 1, 2025

Uh oh!

mollux commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mrubens commented Jun 1, 2025

Uh oh!

taylorwilsdon commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hannesrudolph commented Jun 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mollux commented Jun 1, 2025 •

edited by ellipsis-dev bot

Loading

mollux commented Jun 1, 2025 •

edited

Loading

taylorwilsdon commented Jun 1, 2025 •

edited

Loading